83 results found.
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Arabic Russian Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
5000 sentences Production Status:
Newly created-finished
Use:
Textual Entailment and Paraphrasing
Paper:
N/A
Documentation:
<Not Specified>
Written
Lexicon,
Language Type:
Multilingual
Languages:
Arabic Czech Dutch Finnish Mandarin Chinese
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License
Size:
<Not Specified> <Not Specified>Production Status:
Existing-updated
Use:
<Not Specified>
Paper:
N/A
Documentation:
<Not Specified>
Written
Tokenizer,
Language Type:
Multilingual
Languages:
Arabic
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Existing-used
Use:
Language Modelling
Paper:
N/A
Documentation:
<Not Specified>
Written
Tagger/Parser,
Language Type:
Multilingual
Languages:
Arabic Egyptian Arabic
Availability:
From Owner
License:
non-commercial, research only
Size:
91 MByte Production Status:
Existing-used
Use:
Morphological Analysis
Paper:
N/A
Documentation:
cf. Pasha et al. (2014) http://www.lrec-conf.org/proceedings/lrec2014/pdf/593_Paper.pdfLanguage Type:
Trilingual
Languages:
Arabic English french
Availability:
From Owner
License:
<Not Specified>
Size:
9000000 tokens Production Status:
Newly created-in progress
Use:
Corpus Creation/Annotation
Paper:
N/A
Documentation:
<Not Specified>
Written
Ontology,
Language Type:
Monolingual
Languages:
Arabic
Availability:
Freely Available
License:
Creative Commons attribution-NonCommercial-ShareAlike 4.0
Size:
247 entries Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Developing an Arabic Infectious Disease Ontology to Include Non-Standard Terminology
-
Paper track:Terminology/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Lama Alsudias | Developing an Arabic Infectious Disease Ontology to Include Non-Standard Terminology | /N |
Documentation:
described in the paper
Written
Treebank,
Language Type:
Monolingual
Languages:
Afrikaans Akkadian Amharic Ancient Greek Arabic Armenian Assyrian Bambara Basque Belarusian Bhojpuri Breton Bulgarian Buryat Cantonese Catalan Chinese Classical Chinese Coptic Croatian Czech Danish Dutch English Erzya Estonian Faroese Finnish French Galician German Gothic Greek Hebrew Hindi Hindi English Hungarian Indonesian Irish Italian Japanese Karelian Kazakh Komi Permyak Komi Zyrian Korean Kurmanji Latin Latvian Lithuanian Livvi Maltese Marathi Mbya Guarani Moksha Naija North Sami Norwegian Old Church Slavonic Old French Old Russian Persian Polish Portuguese Romanian Russian Sanskrit Scottish Gaelic Serbian Skolt Sami Slovak Slovenian Spanish Swedish Swedish Sign Language Swiss German Tagalog Tamil Telugu Thai Turkish Ukrainian Upper Sorbian Urdu Uyghur Vietnamese Warlpiri Welsh Wolof Yoruba
Availability:
Freely Available
License:
Various
Size:
25 million words Production Status:
Existing-updated
Use:
Parsing and Tagging
-
Paper title:Universal Dependencies v2: An Evergrowing Multilingual Treebank Collection
-
Paper track:Written/oral presentation
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Joakim Nivre | Universal Dependencies | /N |
Documentation:
https://universaldependencies.org
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic
Availability:
Freely Available
License:
Size:
3.7 GByte Production Status:
Newly created-finished
Use:
Text Mining
-
Paper title:Time-Aware Word Embeddings for Three Lebanese News Archives
-
Paper track:Written/poster presentation with demo
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Shady Elbassuoni | Time-Aware Word Embeddings for Three Lebanese News Archives | /N |
Documentation:
Available in English
Written
Corpus,
Language Type:
Monolingual
Languages:
Arabic Chinese Czech English Finnish French German Hindi Indonesian Italian Japanese Korean Polish Portuguese Russian Spanish Swedish Thai Turkish
Availability:
Freely Available
License:
CC-BY-SA
Size:
300 KByte Production Status:
Newly created-finished
Use:
Emotion Recognition/Generation
-
Paper title:How Universal are Universal Dependencies? Exploiting Syntax for Multilingual Clause-level Sentiment Detection
-
Paper track:Written/poster presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Hiroshi Kanayama | Parallel Sentiment | /N |
Documentation:
For 19 languages (ar,cs,de,en,es,fi,fr,hi,id,it,ja,ko,pl,pt,ru,sv,th,tr,zh)
Written
Evaluation Data,
Language Type:
Multilingual
Languages:
Arabic Chinese English
Availability:
From Data Center(s)
License:
LDC
Size:
303833 words Production Status:
Existing-used
Use:
Information Extraction, Information Retrieval
-
Paper title:Towards Few-Shot Event Mention Retrieval: An Evaluation Framework and A Siamese Network Approach
-
Paper track:Evaluation/oral presentation
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Main Contact | Bonan Min | ACE (Automatic Content Extraction) 2005 Corpus | /N |
Documentation:
Yes. English. Yes.




